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ABSTRACT 


A new type of vector multiplication defines a group 
(whose elements are vectors) isomorphic to the group of 
rotations. This allows a vector representation of rotations 
which has many advantages over the usual matrix or Euler - 
ian angles approach— e.g., this vector representation avoids 
the need for trigonometric relationships and requires only 
three independent parameters. Several applications show 
the simplicity of this vector representation— in particular, 
an analytic solution to a least -squares rotation problem. 
The differential equations defining the motion of a rigid 
body are obtained in terms of a vector differential equation. 
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A VECTOR APPROACH TO THE ALGEBRA 
OF ROTATIONS WITH APPLICATIONS 

by 

Paul B. Davenport 

Goddard Space Flight Center 

INTRODUCTION 

The algebra of rotations is generally approached through matrix algebra, where the matrix of 
the rotation is defined by the product of simple rotation matrices. Each of the simple rotations is 
about a coordinate axis and is uniquely determined by an angle. The most general rotation is uni- 
quely defined by three angles (Eulerian angles), provided that the axis of each rotation is known. 
The convention of defining Eulerian angles varies considerably in the literature, so that, when only 
three angles are known, the product matrix is ambiguous. Furthermore, the angular or matrix ap- 
proach to rotations requires the evaluation of trigonometric functions and therefore the use of 
tables or a computer. The approach here is to define a group (whose elements are three-dimensional 
vectors) isomorphic to the group of rotation matrices. The algebra of rotations can then be repre- 
sented by the algebra of vectors (dot and cross product plus a new vector product which will be 
defined later), and does not require the evaluation of trigonometric functions. 

A comment on notation: matrices and vectors are denoted by capital English letters and their 
elements by small English letters with subscripts. All vectors are considered as column vectors; 
a superscript T denotes the transpose of a vector or matrix. A superscript -1 denotes the inverse 
matrix. Primes denote the transform of a vector. Vector notation is used merely to simplify the 
algebraic relationships existing among the components of various vectors, and is not intended to 
suggest a physical interpretation. Even though two vectors may represent the same physical 
quantity, they are not regarded as equal unless they are expressed in the same coordinate system. 
Similarly, the notation u VxW may be used (even though V and W are expressed in different co- 
ordinate systems) merely to indicate that the components of u are formed from the components of 
V and W according to the standard equations for cross products. It will be assumed throughout that 
all coordinate systems are right-handed orthonormal systems. 

VECTORS DEFINING ROTATIONS AND THEIR ALGEBRA 

If R is the matrix of a rotation, then References 1 and 2 show that R may be expressed by 
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where X = (x ( . x 2 , x 3 ) T is a unit vector defining the axis of rotation and 8 is the angle of rotation. 
Except when otherwise stated, it is assumed that x has been selected so that 0 <0 < 77 ; this is always 
possible, since R x (0) = R_ x (-8) and R x (v + 8) = R _ x (-n-8). 

The following expression for R may also be found in Reference 1: 


R = (b 7 )'^, 


(2) 


where 


B 




(3 'f 


In order to eliminate the angle in Equations 1 and 3, we assign a length (which is a function of 
6) to X, in such a way that the sine and cosine of 8 can be determined by this length. Many such 
functions exist; we choose tan 9 / 2 and sin 6 / 2 as examples. Thus, let 


6 6 

Y - tan 2 X ’ Z - sin 2 X , Where 0 < 6 < rr ; 

by standard trigonometric identities we obtain: 
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Hence the definition of r as given by Equation 1, using y and z, alternatively, becomes 
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If Bis expressed as a function of y, Equation 3 becomes 

* 



( 6 ) 


Thus, every vector y defines a rotation matrix by Equation 4, and every vector z with z 2 < l defines 
a rotation matrix by Equation 5. Hence Equations 4 and 5 define mappings that map sets of vectors 
into the set of rotations. We have yet to verify that the mapping is "onto/ 1 i.e., that for every ro- 
tation matrix there exist vectors y and Z such that Equations 4 and 5 define the matrix of the rotation. 


Let R be the matrix (with elements r . . ) of a rotation. Separating R into symmetric and skew- 
symmetric parts gives 


R = \ (R+R t ) + y (R-R t ) . 


Comparing this with Equation 5 gives the following relations: 

3 - 4Z 2 = or , 


(where a- is the trace of r) 



^ Z 2 Z 3 2 ( F 3 2 + F 2 3 ) ’ 

2z , Z 3 = \ ( r !3 +f 3l) ’ 

2Z 1 Z 2 = \ ( r 21 +r i 2 ) ' 


(7) 


( 8 ) 
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1_2 ( Z 2 2+Z 3 2 ) r ,,' 

1 ‘ 2 ( Z l 2 + Z 3 2 ) r 22’ 

1-2( z , 2 ^z 2 2 ) r 33 . 

These last three equations imply 

z i r 1 1 r 22 ~ r 33)/^ ' 

z 2 = l 1 + r 22 “ r n ~ r 33)/ 4 ’ 
Z 3 2 = (1 + f 33 - r , 1 “ r 22)/ 4 • 


(10) 


Thus, to express R in the form of Equation 5 demands that z satisfy nine conditions; by using the 
well-known properties of rotation matrices, it is easy to verify that these nine conditions are 
consistent. 


Equations 7 and 8 give a convenient expression for z: 


provided that z 2 ? l. If z 2 1 , then Equations 9 and 10 may be used to determine z. In this case, 
however, z is not unique; if it satisfies Equations 9 and 10, so does -z. In practice, this ambiguity 
is of no consequence; because, if Z 2 - 1 , z and -z yield the same matrix. 

From Equation 1, we find that a - 1 + 2 cos 0, which by Equation 11 implies that Z 2 1 1. 

Hence the mapping defined by Equation 5 is a mapping of three-dimensional vectors over the 
field of real numbers whose length is less than or equal to unity (denote this set of vectors by c) 
onto the group of rotation matrices. The mapping is also one-to-one except when Z 2 T 1 ( ; ~ ~1). 

The vector Y may be obtained, like the vector z, directly from the matrix R or indirectly from 
Z itself, by the relationship 



Y 


1 

ii - z 2 


z . 


( 12 ) 
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Thus, 



The vector Y is undefined when cr - -i. This singularity may be removed if we agree to allow vec- 
tors of infinite magnitude whose direction is given by a unit vector, X, say. For such vectors, 
Equation 4 becomes 

R -I + 2XX t . (14) 

With this convention, Y may be obtained from the rotation matrix R with trace -1, if it is noted that 
X Z in this special case. Hence, x may be obtained from Equations 9 and 10. 

Let Tj denote the set of all real three-dimensional vectors augmented by the vectors of infinite 
magnitude discussed above. Then Equations 4 and 14 define a mapping from r, onto the group of 
rotation matrices. Thus, either of the two sets 4 or v may be used to parametrize the group of 
rotations. However, when vectors are used for this purpose it should be emphasized that they cor- 
respond to transformations. Hence, it is their algebraic properties as transformations that con- 
cern us and not their properties as vectors (indeed, 4 is not even a vector space). It does not mat- 
ter that two elements of l or tj may be equal in the sense of equality of transformations but not 
equal in the normal sense of vector equality. On the other hand, the abundance of vector opera- 
tions (and their algebraic properties) that are normally employed to simplify relationships between 
components of vectors can also be applied to vectors corresponding to a rotation. What makes the 
vector parametrization appealing is the fact that the algebra of vectors as rotations can be ex- 
pressed in terms of standard vector notation. In order to distinguish vectors used to parametrize 
rotations from ordinary vectors transformed by rotations, we introduce a new type of vector, which 
is obtained by defining a new equivalence relation among real three-dimensional vectors. Definition 
Given a set h of real three-dimensional vectors and a mapping r that maps S onto the group of ro- 
tation matrices; then two elements of 8 are said to be equivalent if they map into the same rotation 
matrix. For the two sets 4 and r } defined above, vector equivalence is the same as vector equality 
except for vectors mapping into rotation matrices with trace of -l. 

Clearly, vector equivalence as defined above is an equivalence relation and separates the sets 
4 and into disjoint classes. These equivalence classes are merely the inverse images under r of 
the rotation matrices. We shall call these equivalence classes "rotation vectors" and denote them 
in the same manner as ordinary vectors. The algebra usually associated with vectors is also ap- 
plicable to rotation vectors. The well-known operations of scalar multiplication, vector addition, 
and dot and cross products are performed on rotation vectors in the classical manner, and the 
ordinary symbolism is used to denote these operation. When there is more than one vector in the 
equivalence class, the desired result of an operation can be that obtained by using either vector. 
However, the same vector must be used throughout any one expression. 
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Given two rotation vectors Y ^ and y 2 , Equation 4 defines two rotation matrices, say Rj and r 2 , 
respectively. It is well known that R = r 2 r, is also the matrix of a rotation. Thus, by a previous 
discussion there exists a rotation vector Y that defines R. This vector can be obtained from Equa- 
tion 13 by forming the product r 2 r, in terms of Y 1 and Y 2 , which gives the remarkably simple 
expression 


Y = iTi] • Y 2 ( y , +Y 2 +y i * y 2 ) (15) 

If z, and z 2 are members of £ defining Rj and R 2 , respectively, then the ZeC defining r, such 
that R = R 2 Rj, may be obtained in a similar manner, i.e., by comparing the trace and skew- 
symmetric part of r 2 Rj with the trace a and skew -symmetric part of r, respectively, as given by 
Equation 5. This gives 


o- = 3 - 4Z 2 =• 3 - 4Z 0 2 , 


Vi - z 2 z 


(vi-z , 2 VT-z^-z, • z 2 )z 0 , 


where 


= ii -z„ 


‘ Z l 


z. + z. 


Thus 


i - z 2 = 1 - z 0 2 = [Vi-z , 2 ii -z 2 2 - z l • z 2 ] 2 , 


and 


n -z , 2 Vi - z 2 2 - z, • z 2 

■■■ = + 1 , 

Vi -z 2 


where the sign depends only on the numerator, since /l - Z 2 > 0 by definition, i.e., Vi - z 2 = cos 0/2 
where 0 <6 <tt. Hence, using the signum function, sgn(x), which is defined as +1, -l according as 
x is positive or negative, respectively (here zero is considered positive or negative), leads to 

Z = [sgn(Vl-Zj 2 Vl-Z 2 2 -Zj • Z 2 )jz 0 . (16) 

The above argument does not guarantee that the z as given by Equation 16 corresponds to R = R 2 Rj 
when Z 2 = l. However, it is straightforward though tedious to verify that Equation 16 does indeed 
give the z defining R = R 2 R t for all z x and Z 2 contained in £ corresponding to R t and R 2 , respectively, 


6 



Actually, Equation 16 may be obtained more readily by combining Equations 12 and 15 and the 
identity 


z 


vr^ Y> 


(17) 


which is a useful formula since it is also valid in the limit as Y 2 approaches infinity. 

Equations 15 and 16 suggest a new type of vector product. Definition: Let 8 be a set of vec- 
tors, let * be a binary operation on S, and let r be a mapping of 8 onto the group of rotation ma- 
trices. Then * is said to be a rotation product if it is preserved by r, i.e., if r(V *W) = t(V)t(W), 
lor every v and W in 8 . Thus for Z,, Z 2 in i, the product Z = Z 2 * z, given by Equation 16 is a ro- 
tation product. In a similar vein, for Y,, Y 2 in 77, Equation 15 defines a rotation product, except 
when Yj ■ Y 2 = l, or when either of the rotation vectors has infinite magnitude. In these exceptional 
cases, we define the product Y 2 * Y, by forming z x and Z 2 by Equation 17, Z 2 * Z, by Equation 16, and 
then Y 2 *Yj by Equation 12. 

Let v ' and £' denote the two sets of rotation vectors (equivalence classes) defined by vector 
equivalence and the sets 77 and £, respectively. Then each of the two sets 77 and £ , together with 
its rotation product, forms a group. Clearly, 77' is closed under the rotation product. Also, since 
Equation 17 implies that z = Z 2 *Z X has length less than or equal to unity, £' is closed. The as- 
sociativity of the rotation product is easily verified. The identity of each set is the null rotation 
vector, and the inverse of V (in either set) is -V. Furthermore, the respective mappings of the two 
sets defined herein are isomorphisms onto the group of rotation matrices. Thus I (the identity ro- 
tation matrix) is the image of the null rotation vector; R 1 is the image of -v if R is the image of V. 
Actually, a one-to-one relationship between vectors and rotations may be defined without the con- 
cept of equivalence class. For example, when a - - 1 , additional conditions may be imposed on z 
to ensure uniqueness. The choice of additional constraints, however, depends on the preference of 
the user (a situation analogous to the many definitions of Eulerian angles). Thus, we introduced 
vector equivalence to emphasize that for the purpose of representing a rotation it does not matter 
which vector one selects out of an equivalence class for a 180 -degree rotation. 

The group of rotation transformations can now be represented by either of the two groups, £' 
or r,\ This representation has certain advantages over the usual matrix representation, stemming 
from the fact that the rotation is defined by three independent parameters, without recourse to trig- 
onometry (the matrix approach requires nine elements; or the evaluation of six trigonometric 
functions plus two matrix multiplications). In many applications the vector representation requires 
fewer calculations than the matrix representation. For example, matrix multiplication requires 27 
scalar multiplications plus 18 additions; the rotation product of Equation 15 requires only 13 multi- 
plications and 10 additions. 

The choice of vector representation depends on the application. The formulas associated with 
the Y rotation vector are generally quite simple, requiring no radicals or trigonometric functions. 
Thus this vector representation is a valuable tool for hand calculations or for deriving theoretical 
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results, but has the inconvenience of becoming infinite for all 180-degree rotations. The z-vector 
representation, on the other hand, is always finite, always defined (uniquely, except for its sign at 
180-degree rotations), and the rotation product is valid for all rotations; but the existence of the 
radical is inconvenient for hand calculations. The vector representation w - tan CV4X may prove 
to be useful since it combines some of the assets of both the y and z. In this case we have 



COORDINATES OF A ROTATED VECTOR 

A rotation matrix is frequently used to find the coordinates of a vector after rotation. The 
vector representation yields a useful formula for this application. Let V' be the image of V under 
the rotation and R the matrix of the rotation (i.e., v' = rv). If Y and Z are the rotation vectors de- 
fining R, then Equations 4, 5, and 14 give V' as 

r 

i'TyT [(l - Y 2 ) V + 2(V • Y) Y + 2V x y] , Y 2 < ® 

V' r YV = < 

-V + 2(V • X) X Y 2 = x 

V 

V' = ZV = (l - 2Z 2 ) V + 2(V • Z) Z 1 2 Vl - Z 2 V x Z . 

Note that the symbolism YV denotes the image of V under the rotation corresponding to Y. Thus the 
above equations define a vector multiplication in the same sense that RV defines a matrix 
multiplication. 

ROTATIONS DETERMINED BY A VECTOR AND ITS IMAGE 

In many instances we are given two vectors (v and V') and wish to determine the matrix R such 
that v' = rv. If R is to be a rotation matrix, then we must have V' 2 = v 2 . For simplicity, we as- 
sume that v' 2 = v 2 - l. The rotation taking V into v' is not unique; however, the vector representa- 
tion of the "shortest path" rotation is immediately obvious— the axis of rotation is collinear with 


8 



V'x V and the angle of rotation is cos 1 V • v' . Thus 


l 

1 + V * V' 


V' X V , 


V' t '1 


V2( 1 + V * V 7 ) 


= V' X V , V ■ V' t -1 . 


If v • v' = -l, then Z is any vector satisfying the two conditions Z 2 - 1 and Z * V 0; Y has infinite 
magnitude with direction defined by z. 


ROTATIONS DETERMINED BY TWO VECTORS AND THEIR IMAGES 

A common practice in orbit theory is to construct a rotation that takes the xy -plane into the 
plane of the orbit and takes the x-axis into the direction of perigee. This is a particular example 
of the following problem: Given v„ V 2 (v, and V 2 noncollinear), v,', and v 2 ‘ such that v / 2 = v, 2 , 

, and Vj' * V j = Vj * V 2 , find the rotation that takes into V t ' and V 2 into V 2 \ Here again, 


> * 2 - 


t + * - 

the vector representation of the rotation yields a simple solution. 


If v' r RV, then Equation 2 implies that B T V' 
tion, this condition can be written as 


BV. From Equation 6 and by matrix multiplica- 


V'+YxV' - V - Y xV, 


or 


v - v' = Y x (V + v' ) , 
where Y is the vector representation of R. 


Thus, the conditions v/ = RV, and v 2 * = RV 2 may be expressed by 

V, - V,* = Y x ( Vj + v,') and V 2 - v 2 * = yx(v 2 + v 2 '). 


Let 


A i = V t - V/ , Bj = V, + Vj' , ( i = 1,2). 

Then the condition equations become 


A i = Y x B i , (i = 1, 2) , 
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where A, • B t - 0 and A, • b 2 - -\ 2 • b,. It is immediately obvious that Y • = o. If each side of 

the first equation is cross -multiplied by the vector A 2 , we obtain 

A 2 X A 1 = A 2 X (YxB,) = (A 2 • B,) Y - (Aj • Y)b, ^ ( A 2 • Bj Y ; 
thus, if A 2 • Bj = -Aj -b 2 / o, we obtain a simple expression for Y, namely, 


Y 



(Vi'V) 


( V 2-V 2 ‘) ■ 


(18) 


A more general expression may be obtained as follows. Cross -multiply each side of the ith equa 
tion by b £ ; this gives 


Bj X Aj = B/Y - (B. ■ y) B, , (1 - 1. 2) . 

Therefore, 

B 2 • (b, x Aj) = Bj 2 B 2 • Y - (Bj • B 2 ) B, ■ Y, 
B, • (B 2 xA 2 ) = B 2 2 B, ' Y - ( Bl B 2 )B 2 • Y, 
and solving these two linear equations for Bj ■ Y and B 2 • Y yields 

(B j x B 2 ) • [b, 2 A 2 -( Bi -B 2 ) A,] 

(B,xB 2 ) • [(B, -B 2 ) A 2 -B 2 2 Ai ] 

(b,~b 2 ) 2 _ 


B l ' Y 


B 2 • Y 


(19) 


Substitution of these last expressions in Equation 19 gives the solution for Y, provided that B, x b 2 /o. 

If V,’ = V, and/or v 2 ' = V 2 , the z rotation vector is obtained from y by Equations 17 and 19 un- 
less Bj x b 2 = 0; otherwise, Equations 17 and 18 provide a much simpler expression for z, namely, 


sgn(V, V 2 '-V 2 - V/) 


VK -v a '-v 2 - v;) 2 + [(v.-v/) x (V 2 -V 2 ')] ; 


( V i-V) * ( v 2 - v 2 ') 


When v.' = v. and B,x B 2 = 0, then z = V.. The above expressions provide another way to make a 
vector representation of a rotation from the matrix of the rotation r: choose two independent vec- 
tors (Vj and v 2 ) and set v,' = RVj, v 2 ' = RV 2 . 
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Given v,, v 2 , v,‘ and such that v, 2 = V/ 2 , v 2 2 = v 2 ' 2 and V, • v 2 = V/ ■ v 2 ', the matrix of the 
rotation, R, that takes v, into V,' and v 2 into v 2 can be determined from the vector representation 
obtained above and Equation 4 or 5. A more direct approach is to construct two right-handed or- 
thonormal coordinate systems; one from vectors V 2 and v 2 , and the other from vectors v 2 and v 2 . 
The rotation taking the first system into the second will then take v ; into v/. The matrix of this 
rotation can be easily written as the product of two matrices, each of whose rows or columns are 
formed from the components of the constructed axes relative to some underlying coordinate system 


Ui = VV 




- Vj'/iv; 



< 

< 

1 


r . i v i • v) .. “ 

w = 

v - V 

v 2 V 2 v i 

v 1 

, W' = 

V - V' 2 V * 

1 


u 2 = w/lwl , 




w'/lw'l , 


u 3 = u, x u 2 , 

R, = (U!.U 2 .U 3 ). 


U 3 ' = U,' X u 2 ' , 

= K’ u 2'- u 3') : 


then 


R 


R 2 Rj 


A LEAST-SQUARES ROTATION-DETERMINATION OF ATTITUDE* 

Consider a vehicle with a local coordinate system that has been rotated from a fixed coordin- 
ate system. If the vehicle has equipment that can find the direction (relative to the local coordinate 
system) of a point whose direction relative to the fixed coordinate system is known, then the rota- 
tion relating the local and fixed systems can be determined by two or more observations of such 
points, t If the directions were exact, then the rotation relating the two coordinate systems could 
be obtained from any two noncollinear observations as described in the previous section. In prac- 
tice, however, the directions are not exact and in such cases we generally seek the "least-squares" 
solution. 

For the case at hand, we seek a rotation matrix R such that the scalar function 


<*>(») = _rv i) 2 

i = 1 


•This section is a solution to Problem 65-1 by Grace Wahba in Reference 3- 

tThe Orbiting Astronomical Observatory has this capability, where the points are known stars. 
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is a minimum. Here, v i and w, are vectors defining a point relative to the fixed and local coor- 
dinate systems, respectively, v. and w, need not be unit vectors; their length may indicate the 
reliability of the measurement. The matrix R-thus also the function ^(R)-has only three inde- 
pendent parameters. The rotation that makes <fc( R) a minimum is a solution of the three equations 
obtained by setting the partial derivatives of <t> with respect to each independent parameter to zero 
(assuming that the derivatives do exist). The use of Eulerian angles as independent parameters 
leads to three condition equations that are complicated and tedious to derive and must be dealt with 
as three scalar equations. On the other hand, the condition equations in terms of a vector repre- 
sentation of R can be expressed as a single vector equation, easily derived and solvable by vector 
and matrix algebra. 

Since R is a rotation matrix, tf>(R) may be written as 


^(R) 


n 

[ v i 2 + w i 2 ' 2W 1 • (RV^J 


which as a function of Yo? gives 


<i(R) 


22 | V ' 2 + W ' 2 " r+V [( 1-Y2 ) V - ' W * + 2 K • Y ) ( W > • Y ) + 2(w, XV.) • y] 


(20a) 


when Y 2 < and 


^(R) = jv , 2 + W, 2 - 2 |~- V, • W. t 2(V i • X)(W i • X)]j 


(20b) 


(where x 2 i) when Y 2 
In the first case, 


= (1 + ’J 2 ) 2 £]{ 2 [( V i xW i) - Y -( V . - Y )K - Y ) -V, W,] y. 


(1+Y 2 )[(V, ' Y) w. ( + (w ; Y) v . . -(V.xWj^l , 


where the subscript j refers to the j th component of the vector. Thus a necessary condition for 
* to have a minimum (at least among the rotations of less than 180-degrees) at some Y is given 
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by the vector equation 


{z>. 


v i) ■ y + ( v . ■ y ) ( w i ■ y ) + v i 


( 1 +y2 )2I [k ■ y ) w . + ( w i • y ) v . +w . * v j 


( 21 ) 


(For convenience, the range of summation is omitted hereinafter; the letter i will denote the sum- 
mation index.) 

Taking the dot product of each side, with Y gives 

’ r 

2 ' Y ) ( W i ' Y ) = Z] [( W - XV .) • Y + 2V ; -wj Y 2 - £ (w.xV,) ■ Y . 


When this last expression is substituted in the vector equation and the factor l +Y 2 is divided out, 
we obtain 


j[(w. X Vj) • Y + 2Vj • Y - [(V, • Y) W ; + (w ; • y) V, + W; x = 0 . 


The same substitution in Equation 20a gives 


<£(R) r ( v i 2 + w , 2 " 2V. ■ W.) - 2Y ■ £ W i x v i 


( 22 ) 


(23) 


when Y satisfies Equation 22. Hence, to minimize <t> we must take the solution of Equation 22 that 
makes Y*lW i xV i a maximum. 

Equation 22 may be written in matrix notation as follows: 

(a t YI + b)y = A, (24) 

where I is the identity matrix, A is the vector ZW.x V ; , and B is a symmetric matrix, with elements 

b . . - - / { v . . vv . . + v . . w . \ , i ,/ k , 

jk / J \ I i lk lk 1 j ) * J ’ 

b. = 2 / (v * W. - v. . w . \ . 

J ) L , V » * 1 j i j; 

(To see this most easily, express Equations 22 and 24 in component form.) 
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If A = 0, then Y = 0 is the desired solution (there may be other solutions if det b = 0, but the 
value of 4> will be the same for all solutions). If A / 0, then the solutions may be obtained as out- 
lined below. Multiplying each side of the matrix equation by the adjoint of A T Yl + B gives 

[det (A t YI + B)J Y = [adj (A t YI + b)] A ; 
multiplication of this equation by A T yields the scalar equation 

[det (A T YI + B)] A t Y = A t [adj (A t YI + b) j A . 

Denote the scalar A T Y by A; denote the scalar functions det(M +B) and A T [adj(M +B)] A by f(\) and . 
g(A), respectively. Then the above scalar equation may be written as 

\f(\) - g(A) = o . 

Note that the left-hand side, say h(\), is a fourth-degree polynomial in A, and that -f(-A) is the 
characteristic polynomial of the symmetric matrix B. 

The solutions to Equation 24 are obtained by determining the zeros of h(A) and solving the re- 
sulting linear equations. However, in the discussion following Equation 23, it was determined that 
the maximum value of Y • l(w i x v 4 ) leads to the minimum value of £(Y). The maximum value of 
Y ■ £( w. xV ; ) is the largest zero of h(\>; denote this zero by A 0 . 

Since h(\) is a fourth -degree polynomial, \ 0 may be obtained analytically; however, a numeri- 
cal iterative solution is probably more practical, since the zeros of h(\> are easily bounded. Note 
that <t>( Y) is a non -negative function; therefore, Equation 23 implies that 

n 

(v, . 

i = 1 

which provides an upper bound. 

Since b is symmetric, there exists an orthogonal matrix P, say, such that P" 1 BP is diagonal. 

Let Y' and U be vectors such that Y = PY' and A = PU. Then, in terms of y' and U, Equation 24 
becomes 

[(u T Y') I +b] FY' = PU , 

and a premultiplication by P" 1 gives 

pU T Y') I +d] Y' = u , 
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where D is a diagonal matrix whose entries are merely the eigenvalues of B. (Without loss of gen 
erality, we may assume the eigenvalues to be arranged in increasing order, say 2 

Multiplying each side of the last equation by adj [(u T Y') I +d] and then by U T yields the scalar 
equation 


det(u T Y' I + D) U t Y' = U T adj(U T Y' I + D) U . 


However, u T Y' T a t Y ' ^ and D is diagonal, so this last equation may be written in the form 


K i K +X l)( A + S)( X + S) r U l 2 ( X +X j)( X + S) + l, 2 2 ( X +k l)( K + S) + U 3 2 ( A +A l)(' V + S) ■ 


The left-hand side of the above equation is xf(X) and the right-hand side is g(X). We may now 
easily determine the sign of g(X) when X - -X 3 , -X 2 , -x x , respectively (assuming \ l < x 2 <x 3 ), and 
thus also the sign of h(A ) = M(k) -g(A.) at these points. From these considerations, it can be 
shown that ^ is at least as great as -X p and also that the largest zero of g(X) (provided that g(X) 
is not identically zero, i.e., A/ o) is less than or equal to -X r Hence, a lower bound of our de- 
sired zero x o is -X , which can be determined only by solving a cubic. But the largest zero of g(X) 
is also a lower bound, and this can be found by solving a quadratic. 

Thus far, we have obtained the minimum of <£(R) for rotations of less than 180 degrees. To 
obtain the minimum among all 180-degree rotations, we use the method of Lagrange for solving 
extremum problems with a constraint. The necessary conditions become 


- x ) 


(w, • X). 


2^ x . 


1 , 2 , 3 , 


(25) 


where t± is Lagrange’s multiplier and X must be such that x 2 = l. In terms of our previous nota- 
tion, the above conditions may be collected into the single vector equation 


- 2 H v ' •»>) 


i + b : 

_i 


where B is the symmetric matrix introduced in Equation 24. This last equation can be satisfied if 
and only if /j is a root of the cubic equation 


det 



0 . 
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But the roots of this equation are given by 


2 



• W 


k = 1,2,3, 


where the \ are eigenvalues of B. The condition equations thus become 


(B~\ I) X = 0 , X 2 - 1 . 

and the solutions are the unit eigenvectors of B. The value of <£(R) at each of these solutions may 
be obtained by multiplying the jth equation of Equations 25 by Xj and adding the three equations 
together. This yields 


M ^ 4^(V. -X)(W. -X) , 


which, when substituted in the expression for <£(R) (Equation 20b), gives 


<£(R) = TM"? + 2V , ' w .) ~ ' 

= + • 


Thus the eigenvector (with unit length) corresponding to the minimum eigenvalue \ gives the min- 
imum of </>( R) for all 180-degree rotations. 

Equation 23 gives the minimum of <£(R) for rotations other than 180 degrees: 


«*> = £>, -w.) 2 - 2X 0 : 


it was also shown that x„> -X 1# Hence, the rotation giving the minimum R) among all rotations 
is 


Y - (\ Q I + b) " 1 A , when f(^ 0 ) ^ °> 
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or by a 180 -degree rotation with axis of rotation X defined by 


(\ 0 I + b) x = o , and x 2 = l , 


when f(k 0 ) = o, where is the largest zero of h(\). 

To avoid inverting a near -singular matrix and dealing with large values of the components of 
Y when f(x 0 ) is near zero, obtain the Z vector representation of R from Equation 17 and from the 
relationship 


(\ 0 i + b)- 


adj (\„ I +B) adj (^ 0 I + b) 
det(X 0 I+B) f(\ Q ) 


This gives 


Z = 


sgn 

[ f K) 


yl 

i 1 

o 

2 +A T 

^adj (A 

o I + B) 

12 

A 


when f(\) ? 0 


and 


where 


(x 0 i + b)z = 0 

and Z 2 - 1 , 

when f(x 0 ) - 0 

adj (\I +b) 

= \ 2 I + X [Tr(B) I - 

b] + adj (B) , 

f(\) 

= + Tr (B) X 2 + Tr [adj (B)j X + det(B) , 

g(M 

= A t [adj (\I + B)] A 



and Tr(B) denotes the trace of b. These last equations are easily verified by direct calculations. 

If n = 2, Vj 2 = V 2 2 , and w t 2 - W 2 2 , then the following simple procedure gives the least-squares 
solution. Let 


U 2 



Vj 


u; = 

*1 

-W 2 


Iv. 

-V 2 | ’ 


-W 2 | 


V i 

+ V 2 

u 2 ' = 


+ w 2 


iv, 

+ v 2 | ’ 

1*t 

+ w 2 | 
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and use the techniques of the preceding section to obtain the rotation that takes each U i into u/. 
Such a rotation takes the plane determined by v 2 and V 2 into the plane determined by Wj and W 2 and 
ensures that Wj ■ RV t = w 2 • RV 2 . That such a rotation is the least-squares rotation can be verified 
by expressing the V i and w. as functions of u. and U/. It is then clear that 4>( R ) is a minimum 
when U. # " RUj . 

We have also been very successful in solving Equation 21 in the cases where n> 2 , by succes- 
sive substitutions, i.e., using the iteration 

| -f y ^ n 

Y , + > " n [( V i * Y i) W i + V-.W^V.] • 

2^ [( w i“ v i) Y i + ( V , • Y i)( w , -Yj) +V , -»i] 1 = 1 

L = 1 

Here, Y j is the jth approximation to Y, and Y 0 is obtained as above using two of the V L and their 
corresponding images . In fact, in one case studied, the procedure converged even when the 
angle of rotation was as large as 179 degrees. 


RELATIONS BETWEEN EULERIAN ANGLES, MATRIX OF ROTATION, 
AND VECTOR APPROACH 


Since Eulerian angles have a wide usage (especially when the angles correspond to yaw, pitch, 
and roll) it may be convenient or necessary to transform the matrix or vector parametrization of 
a rotation into Eulerian angles. To do this, we must establish a convention or positive sense of 
rotation. Here, we assume that the matrices of the simple rotations about each of the coordinate 
axes are given by 


(*) 



* 3 <*> 



where R t (r J ) indicates a rotation of 0 about the ith axis. The matrix of any rotation R can then be 
written as 


R 


R k (• .,) r , (* 2 ) r i (-=) 


C \ T 1, 2, 3 , 
J j T 1, 2, 3 , 

I k “ 1, 2, 3 , 

Li * j t k . 
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If i, j, and k are distinct, then direct calculation shows that 

sin0 2 = S i j K r k i • cos sin 9 1 = -5 ijk r kj , 

cos 0 cos 9 - r. , , cos 9. sin #, = -S . . . r . . . 

2 1 k k 2 3 ijkji 1 

COS #. COS #, - r , 

2 3 i i 

where S.. k = l if ijk is acyclic permutation of 123, and S. jk = -l otherwise. Thus, if R is given, 
the angles are defined as follows: 



if cos & 2 / 0, 

sin 9 , - -h . . , r. . /cos 6 _ , cos #, - r. . /cos #„ , 

1 ijkkj/ 2 1 kk/ 2 

sin#, = -8 . . . r . . /cos #„ , cos#, = r . . /cos #„ ; 

3 i j k j 1 / 2 3 11 / 2’ 

if cos # 2 - o, # x and # 3 are subject only to the conditions 

Sin ( 8 3 ± *ijk®l) = S ijk r ij- 
OOS (^ 3 ± S 1 ^,) = + S ijk r ik ’ 

where the upper signs are taken if sin # 2 = l and the lower signs when sin # 2 = ~i . 

The factorization is not unique even when cos# 2 / 0, since either choice of sign for cos # 2 
produces the same product matrix R. 

To factorize r in the form 

R = R , (»*)*, (^) R i ( 0 i) 

where the first and last factors are of the same form, let 
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if sin 0 2 / 0, 


sin 6 


r . 

1 j 

i/sin 0 2 , 

cos 0 j 

- 

B . . r ., /sin 6* 
j 1 1 k / 2 ’ 

sin ff 3 


r . . 

) j 

t/ /sin 6 2 , 

cos 

= 

- 5 . . r . ,/sinfr' 

j 1 k 1 / 2 


where 5 = l if ji is in natural cyclic order, and B . . = -1 otherwise. If sin = o, 6 and £ 3 are 

subject only to the conditions 


sin (° 3 ± 0 i) = S j . r k j ’ 

cos K ± 0 l) = ±r kk , 

where the plus sign is taken if r. . > o and the minus sign if r . . < o. 

Given Eulerian angles & 19 <? 2 , as defined above, we can obtain the vector representation by 
forming the product matrix r, and then using the techniques of the previous sections. A more di- 
rect approach is to use Equations 15 or 16 twice, where 


Q 6 

Y. - tan 0 E , Z . - sin ^E. , -77 < 6 <tj , 

1 1 2 1 — 1 

are the vectors corresponding to (0), and E i is the coordinate axis about which the rotation is 
taken. (If 77 <0 < 277 , then the negatives of the above expressions for the range <6 < 0 must be 
used.) 

To obtain the Eulerian angles from the vector Z, use the expressions (derived from Equation 5) 

r ij = 2 [z. Z) +z k H - Z 2 ] , i / j , 

r . . = 1 - 2Z 2 + 2z ; 2 , 

where the upper sign is used when ij is in natural cyclic order and the lower sign otherwise. The 

angles are then obtained from the appropriate formulas above. 


EQUATIONS OF MOTION OF A RIGID BODY 

If R(t) is the matrix of a rotation that defines the orientation of a coordinate system (attached 
to a moving rigid body) relative to a fixed coordinate system, then R satisfies the matrix differen- 
tial equation 


R(t) - n(t)R(t), R(0) = I, 


(26) 
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where n is a skew-symmetric matrix such that Ov = V x^(t) for all vectors V, and is the angular 
velocity vector (Reference 4). This equation, in the form 

RR" 1 1 0, 


With R ^ R k (0 3 ) R. (fi 2 )R. gives 

. n = rr-> - R k Rj R i R f 1 Rj" 1 R^ 1 + R k R , V 1 R k ‘ 1 + R k R k _1 ( 2? ) 

Although this is a matrix equation, it represents only three independent component equations, since 
each of the product matrices on the right is skew -symmetric. These three independent equations 
can be collected into a single vector equation by means of the well-known isomorphism between 
3x3 skew-symmetric matrices and three-dimensional vectors, 


S(V) 




(28) 


It is easy to verify that if R is a rotation matrix, then RS(V)R‘ 1 *^RV. Also, from the definitions of 
, it is easy to show that Ify ( 0 ) fy” 1 ( 0)~0 E ? , where is the coordinate axis of rotation (this is 
also valid for rotations about any fixed line). For these reasons, and because Equation 27 

is equivalent to 


( 6 ’ 2 ) E . + <4 2 R k 4 ^3 E U 


A' , 


"( 0 ) = 0 , 


(29) 


where A is a matrix whose columns are the vectors R k R.E i , R k E , and E k , respectively, and 
(‘i. 'r'a) 1 - 

The matrix differential equation has no singularities, but requires the integration of nine 
scalar functions. Equation 29, on the other hand, only involves three scalar functions, but the 
matrix A is singular when cos t> 2 r o for k t i, or when sin 1 9 - 0 for k = i. Thus, no set of 
Eulerian angles can be chosen so that 0 will be defined for all rotations. In fact, any set of 
Eulerian angles gives singularities for rotations as small as 90 degrees. 

To obtain the equations of motion expressed in terms of the Y vector, we merely differentiate 
Equation 13 and make the proper substitutions using Equations 4 and 13 and the matrix differential 
equation. This gives 


[O * Y ) Y + Y x + a;] , 


Y(0) 


0 . 


(30) 
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To solve for oj, cross -multiply each side by Y, and subtract the resulting equation from the original. 
Thus, 


J (l + Y 2 ) w = Y - Y x Y. 

Differential Equation 30 has no singularities, but from the definition of Y we know that solutions 
involving 180-degree rotations will diverge to infinity. However, for many applications (involving 
only moderate displacements of the moving frame) this presents no difficulties. The differential 
equations in terms of the Z rotation vector can be obtained as they were in terms of the Y rotation 
vector. A more direct approach is to use the identity 


z 


/TTV* Y ’ 


(31) 


whence, 


1 • Y - Y 

VTT y 7 Y (l + y 2 ) i\ + y 2 Y ' 


(32) 


Equation 30 gives 


Y ‘ Y = \ (l + Y 2 ) * Y . (33) 

Combining Equations 30 through 33 gives 

Z = y [z x a; + VT “ Z 2 w] , Z(0) = 0- (34) 

Conversely, let Z(t) be a differentiable vector function such that over some interval (say, 
t 0 <t<t 1 ) z 2 < 1 and Z satisfies Equation 34. Then Z(t) defines a rotation matrix R(t) by Equa- 
tion 5, and R is given by 


R = -4Z • ZI + 2 (ZZ T + ZZ T ) + 2/l -Z 2 S(Z) - 2Z * Z (l - Z 2 )“ 1/2 S(Z) , 

where S(V) denotes the skew-symmetric matrix formed from the vector V by Equation 28. Taking 
the dot product of each side of Equation 34 with Z leads to 


Z • Z 


1 d(z 2 ) 

2 dt 


1 

2 


zii - z 2 , 
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or 


z ■ z(i - z 2 )" l/2 = p • z, (35) 

since z 2 < 1. When Equations 34 and 35 are substituted in the expression for R, then 

R = VT^Z 1 [za. T + aZ T + s(z x cu + - 2a.- Zl] + Z(Zx ^) T + (Zx a) Z T - a • ZS(Z) , 

and direct calculation will verify that r = fiR for t 0 < t < t 2 . In fact, R = OR even if z 2 - 1, provided 
that z has a derivative at this point satisfying Equation 34 and that Equation 35 is valid in the limit 
as Z 2 approaches unity. For example, if o> is a non-zero constant vector, then 



is a function satisfying Equation 34 for —n < |a.| t <v, indeed, the matrix R defined by z satisfies 
Equation 26 over the same interval. 

Thus, both Equations 30 and 34 define the motion of a rigid body over a wider range of allow- 
able orientations than Euler's equations (Equation 29) and require only the integration of three 
scalar equations (which do not contain trigonometric functions). Furthermore, the results of the 
integration (especially z) can be used directly. There is no need to generate the matrix; for ex- 
ample, we can obtain the coordinates of a vector relative to the body system and Y or Z as shown 
under "Coordinates of a Rotated Vector." If z x defines the orientation of the body at time t, rela- 
tive to the body system at t = o, and Z 2 defines the orientation at time t 2 relative to the body sys- 
tem at t = tj, then the rotation product z = Z * Z, gives the orientation at time 1 2 relative to the 
body system at t 0 for all Z t and Z 2 . Eulerian angles can also be obtained as described in the 
previous section. 

As with most differential equations, one would have to devote considerable time to Equations 
30 and 34 in order to describe completely the properties of the solutions. The equations have one 
property in common that is useful for approximating the solutions for small increments of time. 

If Y(0) ^ Z(0) = 0, then the nth derivative of Y or Z at t = 0 contains the term (0)-the 

(n- i )th derivative of a at t = 0; for the first two derivatives this is the only term. Thus, 

f h 

Y(h) = y 1 o.dt + U , 

Jo 

r ' h 

Z(h) = y a dt + V , 

Jo 
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where u and v are of the order of h 3 . Hence, to the second order, both solutions may be approxi- 
mated by the integral of the angular velocity. 


CONCLUSION 

The significant advantages of the vector approach to rotations, as presented here, over other 
parametrizations is that the vector parameters can be obtained with ease from basic data; we need 
not transform them to a new set in order to perform the algebra of rotations (the product of ro- 
tations and the product of a vector by a rotation). We need not evaluate the trigonometric func- 
tions; there are no longer the singularities that existed when we tried to write a vector in polar 
form or factor a rotation matrix into simple rotations. To illustrate these last remarks, we cite 
one final important application of our vector approach to rotations. 

In orbit theory, it is customary to obtain the components of the position and velocity vectors 
by rotating a coordinate system in which the direction cosines of the angular momentum vector 
are given by E 3 - (0. 0, 1) T into a fixed system in which the direction cosines of this vector are 
also known, say H = (hj, h 2 , h 3 ) T . We usually assume that h is of the form H = (sin i sinn, 

- sin i cosfi, cos i ) T ; thus the rotation is normally given by r = r 3 (-fi)R 1 (-i). Unfortunately, this 
technique produces a singularity even in the trivial case i - o (no rotation required). On the other 
hand, we can refer to the section "Rotations Determined by Two Vectors and Their Images," 
obtaining 


Z 


/ 2 (l +h 3) 



as the rotation vector taking e 3 into H. Thus, the two parameters and z 2 define the rotation 
uniquely for all i except i = n. 
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